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SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: |-V 

C--> 6 (i) APPLICANT: ANDRE E LAHAYE ^ fl\l tl ff^ H P l^J 

7 ERIC DE BUYL I— * ^" ■ * —l ■ 1 ™" 

8 PIERRE LEDOUX 

9 RENE DETROZ 

11 (ii) TITLE OF INVENTION: Xylanase, microorganisms produced it, 

12 DNA molecule, processes for preparation of this xylanase 

13 and uses thereof 
15 (iii) NUMBER OF SEQUENCES: 29 

17 (iv) CORRESPONDENCE ADDRESS: 

18 (A) ADDRESSEE: WILLIAN BRINKS HOFER GILSON & LIONE 

19 (B) STREET: 2000 K St., N.W., Suite 200 

20 (C) CITY: Washington 

21 (D) STATE: D.C. 

22 (E) COUNTRY: U.S.A. 

23 (F) ZIP: 20006 

25 (v) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: Floppy disk 

27 (B) COMPUTER: IBM PC compatible 

28 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

29 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
31 (vi) CURRENT APPLICATION DATA: 

C--> 32 (A) APPLICATION NUMBER: US/09/909,207 

C--> 33 (B) FILING DATE: 19-Jul-2001 

34 (C) CLASSIFICATION: 

36 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: 08/470,953 

38 (B) FILING DATE: 06-JUNE-1995 

40 (Viii) ATTORNEY/AGENT INFORMATION: 

41 (A) NAME: Wilhlem F. Gadiano, Esq. 

42 (B) REGISTRATION NUMBER: 37,136 

43 (C) REFERENCE/DOCKET NUMBER: 4121-40 

45 (ix) TELECOMMUNICATION INFORMATION: 

46 (A) TELEPHONE: 202-429-0625 

47 (B) TELEFAX: (202) 293-1850 

48 (C) TELEX: 650 383-5605 
50 (2) INFORMATION FOR SEQ ID NO: 1: 

52 (i) SEQUENCE CHARACTERISTICS: 

53 (A) LENGTH: 663 base pairs 

54 (B) TYPE: nucleic acid 

55 (C) STRANDEDNESS : single 

56 (D) TOPOLOGY: linear 

58 (ii) MOLECULE TYPE: DNA (genomic) 

60 (vi) ORIGINAL SOURCE: 

61 (B) STRAIN: Bacillus 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



file://C:\Crf3\Outhold\VsrI909207.htm 



8/14/01 



Page 2 of 7 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/909 , 207 



DATE: 08/14/2001 
TIME: 13:54:09 



Input Set : N:\Crf3\RULE60\09909207.txt 
Output Set: N:\CRF3\08142001\I909207.raw 



67 CAAATCGTCA CCGACAATTC CATTGGCAAC CACGATGGCT ATGATTATGA ATTTTGGAAA 
69 GATAGCGGTG GCTCTGGGAC AATGATTCTC AATCATGGCG GTACGTTCAG TGCCCAATGG 
71 AACAATGTTA ACAACATATT ATTCCGTAAA GGTAAAAAAT TCAATGAAAC ACAAACACAC 
73 CAACAAGTTG GTAACATGTC CATAAACTAC GGAGCCAACT TCCAACCAAA TGGTAATGCG 
75 TATTTATGCG TCTATGGTTG GACTGTTGAC CCTCTTGTCG AATATTATAT TGTCGACAGT 
77 TGGGGCAACT GGCGTCCACC AGGAGCAACG CCTAAGGGGA CCATCACTGT TGATGGAGGA 
79 ACATATGATA TCTACGAGAC TCTTAGAGTC AATCAACCCT CCATTAAGGG GATTGCCACA 
81 TTTAAACAAT ATTGGAGTGT TCGAAGATCG AAACGCACGA GTGGCACGAT TTCTGTCAGC 
83 AACCACTTTA GAGCGTGGGA AAACTTAGGG ATGAATATGG GGAAAATGTA TGAAGTCGCG 
85 CTTACTGTAG AAGGCTATCA AAGTAGCGGA AGTGCTAATG TATATAGCAA TACACTAAGA 
87 ATTAACGGTA ACCCTCTCTC AACTATTAGT AATGACGAGA GCATAACTTT GGATAAAAAC 
89 AAT 

91 (2) INFORMATION FOR SEQ ID NO: 2: 

93 (i) SEQUENCE CHARACTERISTICS: 

94 (A) LENGTH: 663 base pairs 

95 (B) TYPE: nucleic acid 

96 (C) STRANDEDNESS : single 

97 (D) TOPOLOGY: linear 

99 (ii) MOLECULE TYPE: DNA (genomic) 

101 (vi) ORIGINAL SOURCE: 

102 (B) STRAIN: Bacillus 

104 (ix) FEATURE: 

105 (A) NAME/KEY: mat_peptide 

106 (B) LOCATION : 1 . .663 

108 (ix) FEATURE: 

109 (A) NAME/KEY: CDS 

110 (B) LOCATION: 1. . 663 

113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

115 CAA ATC GTC ACC GAC AAT TCC ATT GGC AAC CAC GAT GGC TAT GAT TAT 

116 Gin lie Val Thr Asp Asn Ser lie Gly Asn His Asp Gly Tyr Asp Tyr 

117 1 5 10 15 

119 GAA TTT TGG AAA GAT AGC GGT GGC TCT GGG ACA ATG ATT CTC AAT CAT 

120 Glu Phe Trp Lys Asp Ser Gly Gly Ser Gly Thr Met lie Leu Asn His 

121 20 25 30 

12 3 GGC GGT ACG TTC AGT GCC CAA TGG AAC AAT GTT AAC AAC ATA TTA TTC 

124 Gly Gly Thr Phe Ser Ala Gin Trp Asn Asn Val Asn Asn lie Leu Phe 

125 35 40 45 

127 CGT AAA GGT AAA AAA TTC AAT GAA ACA CAA ACA CAC CAA CAA GTT GGT 

128 Arg Lys Gly Lys Lys Phe Asn Glu Thr Gin Thr His Gin Gin Val Gly 

129 50 55 60 

131 AAC ATG TCC ATA AAC TAC GGA GCC AAC TTC CAA CCA AAT GGT AAT GCG 

132 Asn Met Ser lie Asn Tyr Gly Ala Asn Phe Gin Pro Asn Gly Asn Ala 

133 65 70 75 80 

135 TAT TTA TGC GTC TAT GGT TGG ACT GTT GAC CCT CTT GTC GAA TAT TAT 

136 Tyr Leu Cys Val Tyr Gly Trp Thr Val Asp Pro Leu Val Glu Tyr Tyr 

137 85 90 95 

139 ATT GTC GAC AGT TGG GGC AAC TGG CGT CCA CCA GGA GCA ACG CCT AAG 

140 lie Val Asp Ser Trp Gly Asn Trp Arg Pro Pro Gly Ala Thr Pro Lys 

141 100 105 110 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
663 



48 



96 



144 



192 



240 



288 



336 
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143 


GGG 


ACC 


ATC 


ACT 


GTT 


GAT 


GGA 


GGA 


ACA 


TAT 


GAT 


ATC 


TAC 


GAG 


ACT 


CTT 


144 


Gly Thr 


He 


Thr 


Val 


Asp 


Gly 


Gly 


Thr 


Tyr 


Asp 


He 


Tyr 


Glu 


Thr 


Leu 


145 






115 










120 










125 








147 


AGA 


GTC 


AAT 


CAA 


CCC 


TCC 


ATT 


AAG 


GGG 


ATT 


GCC 


ACA 


TTT 


AAA 


CAA 


TAT 


148 


Arg 


Val 


Asn 


Gin 


Pro 


Ser 


He 


Lys 


Gly 


He 


Ala 


Thr 


Phe 


Lys 


Gin 


Tyr 


149 




130 










135 










140 










151 


TGG 


AGT 


GTT 


CGA 


AGA 


TCG 


AAA 


CGC 


ACG 


AGT 


GGC 


ACG 


ATT 


TCT 


GTC 


AGC 


152 


Trp 


Ser 


Val 


Arg 


Arg 


Ser 


Lys 


Arg 


Thr 


Ser 


Gly 


Thr 


He 


Ser 


Val 


Ser 


153 


145 










150 










155 










160 


155 


AAC 


CAC 


TTT 


AGA 


GCG 


TGG 


GAA 


AAC 


TTA 


GGG 


ATG 


AAT 


ATG 


GGG 


AAA 


ATG 


156 


Asn 


His 


Phe 


Arg 


Ala 


Trp 


Glu 


Asn 


Leu 


Gly 


Met 


Asn 


Met 


Gly 


Lys 


Met 


157 










165 










170 










175 




159 


TAT 


GAA 


GTC 


GCG 


CTT 


ACT 


GTA 


GAA 


GGC 


TAT 


CAA 


AGT 


AGC 


GGA 


AGT 


GCT 


160 


Tyr 


Glu 


Val 


Ala 


Leu 


Thr 


Val 


Glu 


Gly 


Tyr 


Gin 


Ser 


Ser 


Gly 


Ser 


Ala 


161 








180 










185 










190 






163 


AAT 


GTA 


TAT 


AGC 


AAT 


ACA 


CTA 


AGA 


ATT 


AAC 


GGT 


AAC 


CCT 


CTC 


TCA 


ACT 


164 


Asn 


Val 


Tyr 


Ser 


Asn 


Thr 


Leu 


Arg 


He 


Asn 


Gly Asn 


Pro 


Leu 


Ser 


Thr 


165 






195 










200 










205 








167 


ATT 


AGT 


AAT 


GAC 


GAG 


AGC 


ATA 


ACT 


TTG 


GAT 


AAA 


AAC 


AAT 








168 


lie 


Ser 


Asn 


Asp 


Glu 


Ser 


He 


Thr 


Leu 


Asp 


Lys 


Asn 


Asn 








169 




210 










215 










220 










172 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 3: 
















174 




(i) 


SEQUENCE CHARACTERISTICS: 
















175 






(A) LENGTH: 221 amino 


acids 














176 






(B) TYPE: 


amino acid 


















177 






(D) TOPOLOGY: 


linear 


















179 




(ii) 


MOLECULE TYPE: 


protein 


















180 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 3; 












182 


Gin 


He 


Val 


Thr 


Asp 


Asn 


Ser 


He 


Gly 


Asn 


His 


Asp 


Gly 


Tyr 


Asp 


Tyr 


183 


1 








5 










10 










15 




185 


Glu 


Phe 


Trp 


Lys 


Asp 


Ser 


Gly 


Gly 


Ser 


Gly 


Thr 


Met 


He 


Leu 


Asn 


His 


186 








20 










25 










30 






188 


Gly Gly 


Thr 


Phe 


Ser 


Ala 


Gin 


Trp 


Asn 


Asn 


Val 


Asn 


Asn 


He 


Leu 


Phe 


189 






35 










40 










45 








191 


Arg 


Lys 


Gly 


Lys 


Lys 


Phe 


Asn 


Glu 


Thr 


Gin 


Thr 


His 


Gin 


Gin 


Val 


Gly 


192 




50 










55 










60 










194 


Asn 


Met 


Ser 


He 


Asn 


Tyr 


Gly 


Ala 


Asn 


Phe 


Gin 


Pro 


Asn 


Gly 


Asn 


Ala 


195 


65 










70 










75 










80 


197 


Tyr 


Leu 


Cys 


Val 


Tyr 


Gly 


Trp 


Thr 


Val 


Asp 


Pro 


Leu 


Val 


Glu 


Tyr 


Tyr 


198 










85 










90 










95 




200 


He 


Val 


Asp 


Ser 


Trp 


Gly 


Asn 


Trp 


Arg 


Pro 


Pro 


Gly 


Ala 


Thr 


Pro 


Lys 


201 








100 










105 










110 






203 


Gly Thr 


He 


Thr 


Val 


Asp 


Gly 


Gly 


Thr 


Tyr 


Asp 


He 


Tyr 


Glu 


Thr 


Leu 


204 






115 










120 










125 








206 


Arg 


Val 


Asn 


Gin 


Pro 


Ser 


He 


Lys 


Gly 


He 


Ala 


Thr 


Phe 


Lys 


Gin 


Tyr 


207 




130 










135 










140 










209 


Trp 


Ser 


Val 


Arg 


Arg 


Ser 


Lys 


Arg 


Thr 


Ser 


Gly 


Thr 


He 


Ser 


Val 


Ser 


210 


145 










150 










155 










160 


212 


Asn 


His 


Phe 


Arg 


Ala 


Trp 


Glu 


Asn 


Leu 


Gly 


Met 


Asn 


Met 


Gly 


Lys 


Met 



384 



432 



480 



528 



576 



624 



663 
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213 165 170 175 

215 Tyr Glu Val Ala Leu Thr Val Glu Gly Tyr Gin Ser Ser Gly Ser Ala 

216 180 185 190 

218 Asn Val Tyr Ser Asn Thr Leu Arg lie Asn Gly Asn Pro Leu Ser Thr 

219 195 200 205 

221 lie Ser Asn Asp Glu Ser lie Thr Leu Asp Lys Asn Asn 

222 210 215 220 
224 (2) INFORMATION FOR SEQ ID NO: 4: 

226. (i) SEQUENCE CHARACTERISTICS: 

227 (A) LENGTH: 744 base pairs 

228 (B) TYPE: nucleic acid 

229 (C) STRANDEDNESS : single 

230 (D) TOPOLOGY: linear 

232 (ii) MOLECULE TYPE: DNA (genomic) 

234 (vi) ORIGINAL SOURCE: 

235 (B) STRAIN: Bacillus 

239 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

241 ATGAGACAAA AGAAATTGAC GTTGATTTTA GCCTTTTTAG TTTGTTTTGC ACTAACCTTA 60 

24 3 CCTGCAGAAA TAATTCAGGC ACAAATCGTC ACCGACAATT CCATTGGCAA CCACGATGGC 120 

245 TATGATTATG AATTTTGGAA AGATAGCGGT GGCTCTGGGA CAATGATTCT CAATCATGGC 180 

247 GGTACGTTCA GTGCCCAATG GAACAATGTT AACAACATAT TATTCCGTAA AGGTAAAAAA 24 0 

249 TTCAATGAAA CACAAACACA CCAACAAGTT GGTAACATGT CCATAAACTA CGGAGCCAAC 300 

251 TTCCAACCAA ATGGTAATGC GTATTTATGC GTCTATGGTT GGACTGTTGA CCCTCTTGTC 360 

253 GAATATTATA TTGTCGACAG TTGGGGCAAC TGGCGTCCAC CAGGAGCAAC GCCTAAGGGG 420 

255 ACCATCACTG TTGATGGAGG AACATATGAT ATCTACGAGA CTCTTAGAGT CAATCAACCC 4 80 

257 TCCATTAAGG GGATTGCCAC ATTTAAACAA TATTGGAGTG TTCGAAGATC GAAACGCACG 540 

259 AGTGGCACGA TTTCTGTCAG CAACCACTTT AGAGCGTGGG AAAACTTAGG GATGAATATG 600 

261 GGGAAAATGT ATGAAGTCGC GCTTACTGTA GAAGGCTATC AAAGTAGCGG AAGTGCTAAT 660 

263 GTATATAGCA ATACACTAAG AATTAACGGT AACCCTCTCT CAACTATTAG TAATGACGAG 720 

265 AGCATAACTT TGGATAAAAA CAAT 744 
267 (2) INFORMATION FOR SEQ ID NO : 5: 



269 (i) SEQUENCE CHARACTERISTICS: 

270 (A) LENGTH: 744 base pairs 

271 (B) TYPE: nucleic acid 

272 (C) STRANDEDNESS: single 

273 (D) TOPOLOGY: linear 

275 (ii) MOLECULE TYPE: DNA (genomic) 

277 (vi) ORIGINAL SOURCE: 

278 (B) STRAIN: Bacillus 

280 (ix) FEATURE: 

281 (A) NAME/KEY: CDS 

282 (B) LOCATION :1. .744 

284 (ix) FEATURE: 

285 (A) NAME/KEY: mat_peptide 

286 (B) LOCATION: 82. .744 

288 (ix) FEATURE: 

289 (A) NAME/KEY: sig_peptide 

290 (B) LOCATION : 1 . .81 

293 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
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295 


ATG 


AGA 


CAA 


AAG 


AAA 


TTG 


ACG 


TTG 


ATT 


TTA 


GCC 


TTT 


TTA 


GTT 


TGT 


TTT 


48 


296 


Met 


Arg 


Gin 


Lys 


Lys 


Leu 


Thr 


Leu 


He 


Leu 


Ala 


Phe 


Leu 


Val 


Cys 


Phe 




297 


-27 




-25 










-20 










-15 










299 


GCA 


CTA 


ACC 


TTA 


CCT 


GCA 


GAA 


ATA 


ATT 


CAG 


GCA 


CAA 


ATC 


GTC 


ACC 


GAC 


96 


300 


Ala 


Leu 


Thr 


Leu 


Pro 


Ala 


Glu 


He 


He 


Gin 


Ala 


Gin 


He 


Val 


Thr 


Asp 




301 




-10 










-5 










1 








5 




303 


AAT 


TCC 


ATT 


GGC 


AAC 


CAC 


GAT 


GGC 


TAT 


GAT 


TAT 


GAA 


TTT 


TGG 


AAA 


GAT 


144 


304 


Asn 


Ser 


He 


Gly 


Asn 


His 


Asp 


Gly 


Tyr 


Asp 


Tyr 


Glu 


Phe 


Trp 


Lys 


Asp 




305 










10 










15 










20 






307 


AGC 


GGT 


GGC 


TCT 


GGG 


ACA 


ATG 


ATT 


CTC 


AAT 


CAT 


GGC 


GGT 


ACG 


TTC 


AGT 


192 


308 


Ser 


Gly 


Gly 


Ser 


Gly 


Thr 


Met 


He 


Leu 


Asn 


His 


Gly Gly 


Thr 


Phe 


Ser 




309 








25 










30 










35 








311 


GCC 


CAA 


TGG 


AAC 


AAT 


GTT 


AAC 


AAC 


ATA 


TTA 


TTC 


CGT 


AAA 


GGT 


AAA 


AAA 


240 


312 


Ala 


Gin 


Trp 


Asn 


Asn 


Val 


Asn 


Asn 


He 


Leu 


Phe 


Arg 


Lys 


Gly 


Lys 


Lys 




313 






40 










45 










50 










315 


TTC 


AAT 


GAA 


ACA 


CAA 


ACA 


CAC 


CAA 


CAA 


GTT 


GGT 


AAC 


ATG 


TCC 


ATA 


AAC 


288 


316 


Phe 


Asn 


Glu 


Thr 


Gin 


Thr 


His 


Gin 


Gin 


Val 


Gly 


Asn 


Met 


Ser 


lie 


Asn 




317 




55 










60 










65 












319 


TAC 


GGA 


GCC 


AAC 


TTC 


CAA 


CCA 


AAT 


GGT 


AAT 


GCG 


TAT 


TTA 


TGC 


GTC 


TAT 


336 


320 


Tyr 


Gly 


Ala 


Asn 


Phe 


Gin 


Pro 


Asn 


Gly Asn 


Ala 


Tyr 


Leu 


Cys 


Val 


Tyr 




321 


70 










75 










80 










85 




323 


GGT 


TGG 


ACT 


GTT 


GAC 


CCT 


CTT 


GTC 


GAA 


TAT 


TAT 


ATT 


GTC 


GAC 


AGT 


TGG 


384 


324 


Gly 


Trp 


Thr 


Val 


Asp 


Pro 


Leu 


Val 


Glu 


Tyr 


Tyr 


He 


Val 


Asp 


Ser 


Trp 




325 










90 










95 










100 






327 


GGC 


AAC 


TGG 


CGT 


CCA 


CCA 


GGA 


GCA 


ACG 


CCT 


AAG 


GGG 


ACC 


ATC 


ACT 


GTT 


432 


328 


Gly 


Asn 


Trp 


Arg 


Pro 


Pro 


Gly 


Ala 


Thr 


Pro 


Lys 


Gly 


Thr 


He 


Thr 


Val 




329 








105 










110 










115 








331 


GAT 


GGA 


GGA 


ACA 


TAT 


GAT 


ATC 


TAC 


GAG 


ACT 


CTT 


AGA 


GTC 


AAT 


CAA 


CCC 


480 


332 


Asp 


Gly 


Gly 


Thr 


Tyr 


Asp 


He 


Tyr 


Glu 


Thr 


Leu 


Arg 


Val 


Asn 


Gin 


Pro 




333 






120 










125 










130 










335 


TCC 


ATT 


AAG 


GGG 


ATT 


GCC 


ACA 


TTT 


AAA 


CAA 


TAT 


TGG 


AGT 


GTT 


CGA 


AGA 


528 


336 


Ser 


He 


Lys 


Gly 


He 


Ala 


Thr 


Phe 


Lys 


Gin 


Tyr 


Trp 


Ser 


Val 


Arg 


Arg 




337 




135 










140 










145 












339 


TCG 


AAA 


CGC 


ACG 


AGT 


GGC 


ACG 


ATT 


TCT 


GTC 


AGC 


AAC 


CAC 


TTT 


AGA 


GCG 


576 


340 


Ser 


Lys 


Arg 


Thr 


Ser 


Gly 


Thr 


He 


Ser 


Val 


Ser 


Asn 


His 


Phe 


Arg 


Ala 




341 


150 










155 










160 










165 




343 


TGG 


GAA 


AAC 


TTA 


GGG 


ATG 


AAT 


ATG 


GGG 


AAA 


ATG 


TAT 


GAA 


GTC 


GCG 


CTT 


624 


344 


Trp 


Glu 


Asn 


Leu 


Gly 


Met 


Asn 


Met 


Gly Lys 


Met 


Tyr 


Glu 


Val 


Ala 


Leu 




345 










170 










175 










180 






347 


ACT 


GTA 


GAA 


GGC 


TAT 


CAA 


AGT 


AGC 


GGA 


AGT 


GCT 


AAT 


GTA 


TAT 


AGC 


AAT 


672 


348 


Thr 


Val 


Glu 


Gly 


Tyr 


Gin 


Ser 


Ser 


Gly 


Ser 


Ala 


Asn 


Val 


Tyr 


Ser 


Asn 




349 








185 










190 










195 








351 


ACA 


CTA 


AGA 


ATT 


AAC 


GGT 


AAC 


CCT 


CTC 


TCA 


ACT 


ATT 


AGT 


AAT 


GAC 


GAG 


720 


352 


Thr 


Leu 


Arg 


He 


Asn 


Gly 


Asn 


Pro 


Leu 


Ser 


Thr 


He 


Ser 


Asn 


Asp 


Glu 




353 






200 










205 










210 










355 


AGC 


ATA 


ACT 


TTG 


GAT 


AAA 


AAC 


AAT 


















744 


356 


Ser 


He 


Thr 


Leu 


Asp 


Lys 


Asn 


Asn 




















357 




215 










220 






















360 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: € 
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